AITopics | vt 1

Collaborating Authors

vt 1

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A Polyak-Ruppert Central Limit Theorem for SA-Adam with Momentum and Non-Convergent Adaptive Preconditioning

An, Sunyoung, Huo, Xiaoming

arXiv.org Machine LearningJun-17-2026

Adaptive optimizers combining preconditioning, momentum, and weight decay (Adam and AdamW) are, under Polyak-Ruppert averaging, candidate engines for one-pass inference. Does the averaged iterate keep the classical Polyak-Ruppert central limit theorem (CLT), with sandwich covariance $H^{-1}SH^{-1}$ (Hessian $H$, gradient covariance $S$), under momentum and non-convergent preconditioning? The preconditioner-only analysis does not carry over: with momentum the canonical decomposition collapses to a tautology. Treating the augmented state (iterate, momentum buffer) as a time-varying linear stochastic approximation (SA), we prove (under local stabilization) positive drift stability, a non-autonomous Polyak-Ruppert CLT, and a projection identity. The upshot: the iterate-marginal covariance is exactly the plain stochastic gradient descent (SGD) sandwich $H^{-1}SH^{-1}$, so the adaptivity is asymptotically invisible. This holds for SA-Adam (sub-linearly vanishing momentum gain, $γ\in(α,1)$; the sub-linear regime is essential), not constant-$β$ deployed Adam. Coupled $L_2$ weight decay yields the ridge-penalized sandwich, extending one-pass inference to regularized problems.

artificial intelligence, machine learning, sa-adam, (18 more...)

arXiv.org Machine Learning

2606.17364

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.88)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.55)

Add feedback

Active Learning of Classifiers with Label and Seed Queries

Neural Information Processing SystemsApr-27-2026, 16:22:23 GMT

We study exact active learning of binary and multiclass classifiers with margin. Given an n-point set X Rm, we want to learn an unknown classifier on X whose classes have finite strong convex hull margin, a new notion extending the SVM margin.

artificial intelligence, machine learning, query, (14 more...)

Neural Information Processing Systems

Country: Europe > Italy (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.46)

Add feedback

bb57db42f77807a9c5823bd8c2d9aaef-Paper.pdf

Neural Information Processing SystemsFeb-10-2026, 22:17:32 GMT

We study the reinforcement learning problem for discounted Markov Decision Processes(MDPs)underthetabularsetting.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States (0.05)
Europe > United Kingdom > England (0.04)
Asia > Middle East > Jordan (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.34)

Add feedback

Appendices Contents Appendices 18

Neural Information Processing SystemsFeb-10-2026, 12:01:13 GMT

Diplomacyisacomplex environment, where training requires significant time. The action is an allocation of the player's coins across the fields: the player decides how manyof itsccoins to put in each of the fields, choosing c1,c2,...,cf where Pf Finally, Blotto is a single-turn (i.e.

artificial intelligence, hpi, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > Spain (0.05)
Europe > Russia (0.04)
Europe > Portugal (0.04)
Asia > Russia (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.93)

Add feedback

7 SupplementaryMaterial

Neural Information Processing SystemsFeb-9-2026, 20:46:21 GMT

For all datasets, the same model and training hyper-parameters (except learning rate) were used.

artificial intelligence, machine learning, mt 1, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.68)

Add feedback

R<.Then,inAlgorithm1,wehave E[δT] 2L0R2

Neural Information Processing SystemsFeb-9-2026, 11:38:51 GMT

Again, we supposept,u1,,uq are obtained from the process in SectionB.3.3.

artificial intelligence, qd 1, vt 2, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.46)

Add feedback

Prior-independentDynamicAuctionsfora Value-maximizing Buyer

Neural Information Processing SystemsFeb-9-2026, 09:05:04 GMT

Automatic bidding has become one of the main options for advertisers to buy advertisement opportunities intheonline advertising market[Dolan, 2020]. Theprevalence ofautomatic bidding is partly driven by the fact that it significantly simplifies the interaction between the advertisers and theadvertisingplatform.

artificial intelligence, mechanism, optt, (16 more...)

Neural Information Processing Systems

Industry: Marketing (0.48)

Technology: Information Technology > Artificial Intelligence (0.93)

Add feedback

SupplementaryMaterials AProofofTheorem2: AsymptoticConvergenceofRobustQ-Learning

Neural Information Processing SystemsFeb-8-2026, 06:47:18 GMT

From[BorkarandMeyn,2000],weknowthatthestochastic approximation (18) converges to the fixed point ofT, i.e., Q . Finally, to show Theorem 3, we only need to show each term in(56) is smaller than . In this section we develop the finite-time analysis of the robust TDC algorithm. We note that recently there are several works [Srikant and Ying, 2019, Xu and Liang, 2021, Kaledin et al., 2020] on finite-time analysis of RL algorithms that do not need theprojection. Specifically, the problem in [Srikant and Ying, 2019] is for one time scalelinear stochastic approximation.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.34)

Add feedback

FullyUnconstrainedOnlineLearning

Neural Information Processing SystemsFeb-8-2026, 05:45:13 GMT

We provide a technique for online convex optimization that obtains regret G w Tlog( w G T)+ w 2 +G2 on G-Lipschitz losses for any comparison pointw without knowing eitherG or w .

artificial intelligence, machine learning, proof, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.04)
Europe > United Kingdom (0.04)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Adaptive Federated Optimization

Reddi, Sashank, Charles, Zachary, Zaheer, Manzil, Garrett, Zachary, Rush, Keith, Konečný, Jakub, Kumar, Sanjiv, McMahan, H. Brendan

arXiv.org Machine LearningFeb-29-2020

Federated learning is a distributed machine learning paradigm in which a large number of clients coordinate with a central server to learn a model without sharing their own training data. Due to the heterogeneity of the client datasets, standard federated optimization methods such as Federated Averaging (FedAvg) are often difficult to tune and exhibit unfavorable convergence behavior. In non-federated settings, adaptive optimization methods have had notable success in combating such issues. In this work, we propose federated versions of adaptive optimizers, including Adagrad, Adam, and Yogi, and analyze their convergence in the presence of heterogeneous data for general nonconvex settings. Our results highlight the interplay between client heterogeneity and communication efficiency. We also perform extensive experiments on these methods and show that the use of adaptive optimizers can significantly improve the performance of federated learning.

dataset, fedavg, optimizer, (13 more...)

arXiv.org Machine Learning

2003.00295

Country: